arXiv: 1504.05311v3 [cs.IT] 18 Sep 2015 


Transceiver Design for Clustered Wireless Sensor 
Networks — Towards SNR Maximization 


Yang Liu Jing Li, Xuanxuan Lu 

Electrical and Computer Engineering Department, Lehigh University, Bethlehem, PA 18015 
Email: {yal210@lehigh.edu, jingli@ece.lehigh.edu, xul311@lehigh.edu} 


Abstract —This paper investigates the transceiver design prob¬ 
lem in a noisy-sensing noisy-transmission multi-inpnt multi- 
outpnt (MIMO) wireless sensor network. Consider a clnster- 
based network, where multiple sensors scattering across several 
clnsters will first send their noisy observations to their respective 
clnster-heads (CH), who will then forward the data to one 
common fnsion center (FC). The cluster-heads and the fusion 
center collectively form a coherent-sum multiple access channel 
(MAC) that is affected by fading and additive noise. Our goal 
is to jointly design the linear transceivers at the CHs and the 
FC to maximize the slgnal-to-noise ratio (SNR) of the recovered 
signal. We develop three iterative block coordinated ascent (BCA) 
algorithms: 2-block BCA based on semidefinite relaxation (SDR) 
and rank rednction via randomization or solving linear equations, 
2-block BCA based on iterative second-order cone programming 
(SOCP), and multi-block BCA that lends Itself to efficient closed- 
form solutions in specific but important scenarios. We show 
that all of these methods optimize SNR very well bnt each has 
different efficiency characteristics that are tailored for different 
network setups. Convergence analysis is carried ont and extensive 
nnmerical results are presented to confirm our findings. 

I. Introduction 

Wireless sensor networks (WSN) are known to have a broad 
range of applications such as environmental monitoring, battle¬ 
field surveillance and space exploration H]. A typical wireless 
sensor network is composed of numerous sensors with each 
sensor having limited on-board processing and communication 
capability. The sensors are usually randomly casted over a 
large field in space to sense the same physical event. In 
practice the sensing observations tend to be disturbed due to 
the noise from hardware device or environment. The noisy 
observations are collected by the fusion centers (FC), where 
data fusion and further processing will be performed. The 
large amount of sensors are usually grouped into clusters, with 
each cluster formed by sensors located closely within a small 
neighborhood. Within each cluster, a cluser-head(CH) collects 
observations from other sensors free of error(this is reasonable 
since the cluster members lie in close vicinity) and then 
transmit the collected data to FC. Assume that HCs and FC are 
equipped with multiple antennas and linear transceivers, then 
a central question here is how to jointly design the transceivers 
to transmit the data reliably. This falls in the general problem 
of multi-input multi-output (MIMO) transceiver(beamforming) 
design problem, which has aroused a flurry of interest in recent 
years El-ITOl. 
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Fig. 1: Models for Cluster Based WSN 


We introduce the system model shown in Fig. [T] to capture 
the key characteristics of the afore-mentioned practical sce¬ 
nario. The event of interest 9 is modeled as a complex scalar 
and sensed by a total of sensors (cluster-members) 

coming from L clusters. These sensors communicate their 
imperfect sensing results to their respective cluster-heads. 
Within each cluster i, the observation distortion of the Ki 
sensors are collectively modeled as an additive noise vector rii 
(of size Ki), which may have any distribution (not necessarily 
Gaussian). The cluster-head i performs linear precoding to 
data before transmission with Ni antennas. We assume that 
each cluster-head must conform to an individual transmission 
power constraint Pi, in accordance to its specific power supply 
or battery life. It should be noted that, compared to a single 
overall power constraint for all the cluster-heads, the set of 
separate power constraints makes the model not only more 
practical, but also considerably harder. The MIMO coherent- 
sum multiple access channel (MAC) is considered for com¬ 
munications between cluster-heads and fusion center, which 
achieves a high bandwidth efficiency. The fading channel 
between the )-th cluster-head and the fusion center is denoted 
as Hi. At the fusion center, the received signal is corrupted 
by additive noise np (of size M). The fusion center performs 
linear postcoding to deduce an estimate 9. Our goal is 
to jointly design the transceivers and g to achieve a best 
estimate 9. 

The line of beamforming design in WSN research has 
actually received significant attention in recent years. A good 
variety of system setups have been investigated in the literature 


























m-iini. Compared to these existing literature, our system 
model is generally more generic and complicated. 

For example, 12 considers the case where each cluster 
has only one sensor (i.e. Ki = 1, Vi) and all cluster-heads 
and fusion center are equipped with one single antenna (i.e. 
Ni = 1, Vr and M = 1). Under the total power constraint 
over all the cluster-heads, the transceiver design problem boils 
down to a power allocation problem. 121 extends the scalar 
fading channel in 121 to square nonsingular matrix channels, 
i.e. Ni = M, Vi = I,-- - ,L. Still all clusters share one 
total power constraint. Note that the total power constraint 
over different clusters are usually nonrealistic since in practice 
different cluster-heads can be far away from each other without 
wired connections and are powered separately by build-in 
batteries. H extends the research by employing separate 
power constraints for each cluster-head, but still restricts the 
channel matrices to be square and nonsingular. Block 
coordinate descent(BCD) algorithm is obtained in a to solve 
the problem. 

Several studies are particularly noteworthy a-m. a 
is the first to propose a very general wireless sensor net¬ 
work model, where separate power constraints are employed 
and each fading channel matrix can have arbitrary 
dimension(iVi > M, Ni < M or Ni = M). a studies 
solutions to several special but also rather meaningful cases 
including scalar channels, fading but noiseless channels and 
nonfading but noisy channels. a focuses on the scalar target 
source with nonscalar fading and noisy channels and ob¬ 
tains an approximate BCD algorithm with fully closed form 
solution for each step. The work Q, 0 develop variant 
BCD algorithms to tackle the most general case proposed 
in a, with convergence and closed form solutions being 
examined in 0. All the above works adopt mean square 
error(MSE) as performance metric. Recently joint transceiver 
design problems aiming mutual information(MI) maximization 
are studied in 0, Eol, where orthogonal and coherent-sumq 
MAC are considered respectively. 

The primary interest of this paper is to solve the joint 
transceiver design problem for the system depicted in Fig. [T] 
The previous studies in ||5l- lfT0l all used MSE or Ml as the 
design criterion, and many of them assume a single power 
constraint (which renders an easier problem than separate 
power constraints). Here we take the signal-to-noise ratio 
(SNR) at the output of the EC postcoder g as a figure of merit. 
The SNR is an important performance indicator especially 
for discrete sources (where MSE becomes less meaningful). 
Maximizing SNR is equivalent to maximizing the symbol error 
rate (SER) in the discrete-source detection; and in the special 
case of Gaussian signaling, maximizing SNR is also equivalent 
to maximizing the channel capacity. The joint optimization 
problem is first formulated. Since it is non-convex, and since 
changing the design criterion (e.g. SNR instead of MSE) 
leads to a drastically different target function, the previous 
approaches and results can not be readily applied. Instead, 
we develop three feasible methods, all of which stem from 
the celebrated block coordinated ascent (BCA) principles, but 


each incorporates different techniques specifically tailored to 
the problem at hand, and renders computational strength in 
different scenarios. 

The rest of this paper is organized as follows. The prob¬ 
lem is formulated is Section m The optimal linear receiver 
is obtained in Section inD Two different realizations of 2- 
block coordinated ascent (BCA) algorithm are discussed in 
Section IIVI with their convergence being carefully examined. 
Section lY] proposes a multiple-block coordinate ascent ap¬ 
proach, which promises tremendous reduction in complexity 
for special system settings. Extensive simulations are carried 
out to verify the effectiveness of the proposed algorithms in 
Section |Vl] and Section I VII I concludes the whole paper. 

Notations: We use bold lowercase letters to denote complex 
vectors and bold capital letters to denote complex matrices. 
0, Ojnxn, and 1^ are used to denote zero vectors, zero 
matrices of dimension m x n, and identity matrices of order 
m respectively. A^, A* and are used to denote trans¬ 
pose, conjugate and conjugate transpose(Hermitian transpose) 
respectively of an arbitrary complex matrix A. Tr{ } denotes 
the trace operation of a square matrix. | • | denotes the modulus 
of a complex scalar, and || • ||2 denotes the Z 2 -norm of a 
complex vector, vec(-) means vectorization operation of a 
matrix, which is performed by packing the columns of a matrix 
into a long one column, (g) denotes the Kronecker product. 
Diag{Ai,-- - ,A„} denotes the block diagonal matrix with 
its i-th diagonal block being the square complex matrix A^, 
i G fti}. Re{x} and lm{r:} denote the real and 

imaginary part of a complex value x, respectively. 

II. System Model 

We first discuss of the system model shown in Eig[T] 
and present a mathematical formulation of the optimization 
problem at hand. 

The system is composed of sensors with Ki 

sensors belonging to the ith cluster, L sensor-heads each 
equipped with Ni transmitting antennas, and a fusion center 
equipped with M receiving antennas. All sensors observe a 
common unknown source 9 G C, and send their observation 
to their respective cluster-heads. Without loss of generality, we 
assume 9 has zero mean and unit variance, i.e. E{|0p} = 1. 
The sensing noise and transmission noise (to the ith cluster- 
head) can be collectively modeled as an additive noise rii, 
such that the ith cluster-head observes: 

Xi = lKi9 + ni, i=l,---,L, (1) 

where 1^^ is a vector of dimension KiXl with all the entries 
being 1, and Ui denotes the additive noise vector with zero 
mean and covariance matrix E{ninp} = where S, ^ 0. 
Eollowing the convention of WSN, we assume that rii’s for 
the different clusters are mutually uncorrelated. 

The noisy observation is beamformed by a linear pre¬ 
coder Fi G at each cluster-head before being sent to 

the fusion center. Let G be the MIMO channel 

state information (CSI) from the ith cluster-head to the fusion 


center. Suppose all the cluster-heads form a coherent-sum 
MAC channel; then the fusion center receives signal r; 
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+ no 

(2) 

1 —1 

L 

L 


E' 

2=1 

^ ^ H^F^n^ + no^, 

2=1 

(3) 


where ng is the additive noise at the fusion center. Without 
loss of generality, we assume ng has zero mean and white 
covariance matrix: E{ngn^} = ctqIm- Since the fusion center 
is usually far away from the sensing field, rig is assumed 
uncorrelated with rii’s. 

In practice, it is highly likely that each cluster-head is 
provisioned with a different power supply and hence must 
observe a different transmission power constraint. The average 
transmission power for the i-th cluster-head is ETr{Fi{lKid + 
n,){lK,0 + w)^Ff} = Tr{F,(lK,lf, + Sz)Ff}, which 
must be no greater than a power limit Pi. 

The fusion center uses a linear postcoder g to perform data 
fusion and obtain an estimate 9: 

L L 

= (g" ^ H,Faif.) 0 + ^ H,F, n,+ng) . (4) 


9=g"r=l 


The merit of the recovered signal 9 can be evaluated from 
several different perspectives. When the source 9 takes value 
from a continuous space, the most popular metric is the 
mean square error, defined as MSE = E{\9 — 9\'^}. MSE- 
targeted optimization has also been extensively studied in the 
beamforming literature (e.g. l2l-|[8l). Instead of taking MSE, 
here we target the average SNR, another important metric 
widely used in the design of communication systems. When 
the source 9 is taken from from a finite discrete alphabet set 
(e.g. M-PAM or M-QAM), the detection accuracy is usually 
measured by the symbol error probability taking the form of 
SER Ri ciQ(Vc 2 SNR), with ci and C 2 being some positive 
constants. When the source 9 has Gaussian distribution, the 
system throughput is usually measured by the mutual informa¬ 
tion between 9 and its estimate 9, 1(0, 9) = ^ log 2 (l + SNR). 
Hence, maximizing the SNR automatically minimizes the SER 
(for discrete source) or maximizes the mutual information (for 
Gaussian source). 

Erom dUl, the signal obtained by the fusion center (after 
linear postcoder g) is composed of a signal component and a 
noise component, and the average SNR can be calculated and 
simplified to 
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Hence, the joint transceiver design problem maximizing 
SNR for the clustered wireless sensor network can be for¬ 
mulated as 

(PO) : max. SNR({F}f^i,g), (7a) 

s.t. Tr{F,(lK,lf.+SOFf}<P^A = l,--- ,L. (7b) 

The optimization problem (PO) is nonconvex, as can be 
easily examined by checking the special case of scalar 
transceivers. In what follows, we will exploit the general 
principle of block coordinate ascent method to solve (PO) in 
an iterative manner. 

HI. Optimal Linear Receiver 

In this section, we present the optimal linear receiver g 
which leads to the maximal SNR. The main result is as 
follows: 

Theorem 1. For any predefined SNR is maximized 

if and only if g* has the following form 

/ L \ —1 L 

g*=a(a2lM+^H,F,S,FfHf J (^H^F^Ia,), (8) 

i=l ^ i=l 

where a is arbitrary nonzero complex scalar. The maximal 
SNR is given as 
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SNR* 


.-i L 


^lM+^H,F,S,FfHfj (^H,F,1 a.) 


(9) 


Proof: For simplicity we introduce the following nota¬ 


tions 


L 

M^a^lM+5]H,F,S,FfH 


(10a) 


(10b) 


With all sensors’ beamformers {F^}^]^ given, the SNR max¬ 
imization problem is the following optimization problem 

gffhh^g 

max.—j7——. (11) 

g#o g^Mg 

Since M 0, define g = M^g. The above problem becomes 

g^M“ 3 hh^M“ 5 g 


max .- 

§7^0 




( 12 ) 


SNR 


({F} 


=i,g) = 


EjKg^'EtiH.Fa, 

( T,Li H*F,nj4-ng 


[ Eti H^F^IaJ [ EU 


1 A 


g 


(5) 


( 6 ) 


'^o^llglli + Etig'^H.F.S.FfHfg 
where the assumption of uncorrelated noise across the fusion 
center and different cluster-heads has been invoked. 


From variational perspective, the maximal value of the above 
fractional is obtained if and only if g is aligned with eigen¬ 
vector corresponding to the maximal eigenvalue of the matrix 
IItTI . Notice that matrix is 

rank-one and has only one positive eigenvalue whose eigen¬ 
vector is Q;M~ 2 h, with a being any nonzero complex value. 
Thus the optimal solution of the above problem is g* = 
aM~ 2 h, from which ® and (|9|l can be readily obtained. 

■ 

In practice the factor a can be chosen as 1 for convenience. 















IV. Jointly Optimizing Beamformers at Sensors 


After obtaining the optimal linear receiver g, we focus on 
optimizing precoders at the sensors’ side in this section. 

First by utilizing the identities TrjAB} = Tr{BA} 
and Tr{ABCD} = vec^(D) [C^ (g) A] vec(B) HT], the 
numerator of SNR in ® can be rewritten as follows 


r L 




n H 




-i=i 


^ Tr{(Hfgg^H,)F,(lK, . l^jFf} 


(13) 


2j' = l 
L 


= ^vec^(F,) 
ij'=l 


(lx^.l^J®(Hfgg^H,) vec(F,). (14) 


Similarly the denominator of SNR can be written as 

L 


^o1g|l2 + Eg''H.F,S,FfHfg 

ETr{(Hfgg^H,)F,S,Ff}+a2||g| 


(15) 


= vec 


Hi 


F^) 


s: ® (Hfgg^H,) vec(F,)+a2||g||2, (16) 


A. Solving (PI) by Semidefinite Relaxation 

In this subsection we solve problem (PI) with help of recent 
progress in semidefinite relaxation techniques. 

First we rewrite the quadratic terms f^Af, f^Bf and 
f^Dif in (PI) into inner-product forms Tr{AX}, Tr{BX} 
and TrjDiX} respectively by introducing an intermediate 
variable X = fF^. Omitting this rank-one constraint, a 
relaxation of (PI) is obtained as follows 


(P2) : max. 

X 


Tr{AX} 
Tr{BX} +co’ 


s.t. Tr{DiX} < Pi, 


X)pO. 


i G {1, • • • , L}, 


(20a) 

(20b) 

(20c) 


The fractional semidefinite programming (SDP) objective 
in (P2) is still nonconvex. To solve it, we utilize Charnes- 
Cooper’s approach, which was originally proposed in im and 
subsequently adopted in many fractional SDP optimization 
problems like ifT^l ITSl . This turns (P2) into the following 


SDP problem: 

(P3) : max. Tr{AY}, (21a) 

s.t. Tr{BY} -I- = 1, (21b) 

Tr{-D,Y} <P,o, ie {!,■■■ ,L} (21c) 

Y:^0,z/>0. (21d) 


and the i-th power constraint is expressed as 



(17) 

vec(Fi) < Pi. 


Here we introduce the following notations 


The equivalence between (P2) and (P3) is established by the 
following lemma. 


Lemma 1. The problem (P2) and (PS) have equal optimal 


values. If X* solves (P2), then (^ Tr{BX*+co} ^ Tr{BX*+co} , 
is an optimal solution to (PS). Conversely, if (Y*,v*) solves 
(PS), then V* > 0 and Y* jv* solves (P2). 


fi = vec(Fi), ,L; 



(IatIxJ ® (Hf gg^Hj) , = 


S; ® (Hf gg^H,)' 


1, * * * , -£/, 



, z = 1, • • • , L; 

co = a2||g||^. 




(18a) 

(18b) 

(18c) 

(18d) 

(18e) 


and define the matrix A = (i-S- the (*) 7 )-th 

elementary block of A is Ay), B = Diag{Bi,--- ,Bi} 
and D, = Diag{Oj,i-i , Q, } and pack 

all fi’s into one vector f = [f^^, • • • , fj]^. Then the problem of 
optimizing beamformers {Fi}b^^ with g given is reformulated 
as follows 


(PI) : max. 

f 

s.t. 


f^Af 
f^Bf + Co ’ 
f^D,f < Py 


I G {1, • • • , L}. 


(19a) 

(19b) 


In the following we discuss methods solving the problem 
(PI). 


Proof: See appendix lAl ■ 

Since (P3) is SDP problem, it can be solved by standard 
numerical solvers like CVX ||25]| . Remember that our goal is to 
solve problem (PI). If the optimal solution Y* to (P3) is rank- 
one, then the relaxation (P2) is tight to (PI) and the optimal 
solution of (PI) can be obtained by eigenvalue decomposition 
of Y* jv-k. When the optimal solution Y* has rank larger than 
one, constructing a solution to (PI) from (Y*, v*) still needs 
to be addressed. 

To introduce our first major conclusion we need the follow¬ 
ing lemma. 

Lemma 2. The problem (PS) and its dual are both solvable. 

Proof: See appendix IbI ■ 

For wireless sensor network with small number sensor 
clusters, we have the following conclusion. 

Theorem 2. If the wireless sensor network has no more than 
3 sensor clusters, i.e. L < 3, then the relaxation (P2) is tight 
with respect to (PI). An optimal solution (Y*,v*) to (PS) 
with Y* being rank-one can be constructed and solution to 
(PI) can be obtained by eigenvalue-decomposing Y*/v*. 


















Proof: The proof is inspired by theorem 3.2 of na.ifY* 
has rank one, nothing needs to be proved. Otherwise since the 
problem (P3) and its dual (D3) are both solvable by lemma|2] 
theorem 3.2 of ifTSll is valid to invoke. Dehne r = rank(Y*) 
and perform the following procedure: 

- While rank^(Y*) + rank(j/*) > L + 1 Do 

Step-1: Perform a full rank decomposition Y* = VV^, 
where V G 

Step-2: Find a nonzero pair (A, 6), where A is a r x r 
Hermitian matrix and <5 is real scalar, such that 
the following linear equations are satisfied 

Tr{V^BVA}-fcoi^*^ = 0; (22) 

Tr{V^D,VA}-P,i/*^ = 0,i = l,--- ,L; (23) 

Step-3: Evaluate «; = max ^|Ami„(A)|, |Amax(A)|, |(5|^; 

Step-4: Update Y* = - k"! A) V^, v* = 

v*{\ — k~^5) and r = rank(Y*); 

- End While 

In fact (P3) has two semidehnite variables Y and z/(note that 
V is actually a nonnegative real scalar) and L + 1 constraints. 
As long as the condition rank^(Y*) -f rank(z/)* > L -f 1 
holds, nonzero solutions to (l22ll and (l2Tl l exist. Thus after 
each repetition a new optimal solution is constructed with 
rank(Y*) being reduced by at least 1. Finally we obtain 
rank^(Y*) -f rank(z/)* < L + 1. Recall that i/* > 0 by lemma 
[T] so rank(z/)* = 1 and we have rank^(Y*) < L < 3. So 
rank(Y*) = 1 and the theorem is proved. ■ 

In the above, we have seen that (PI) can be tackled by 
solving a SDP problem and then a finite number of linear 
equations when L < 3. However the assumption that L < 3 
is still very stringent since in practice a sensor network can 
usually be composed of numerous clusters. A method to 
solve (PI) suitable for arbitrary L is still desirable. In the 
sequel, we proceed to discuss randomization method inspired 
by the recent literature El. Before going into details, first we 
modify the problem (P3) a little bit. By changing the equality 
constraint (I21bl i into inequality, we have another SDP problem 


(P4) as follows 

(P4) : nmx. Tr{AY}, (24a) 

s.t. TrjBY} + cqv < 1, (24b) 

Tr{D,Y} < i e {1, • • • , L} (24c) 
Y)pQ,v>Q. (24d) 


We assert that (P3) and (P4) are equivalent and for any 
solution {Y*,v*) to (P4), v* must be positive. In fact since 
(P4) is a relaxation of (P3), opt(P4) > opt(P3). Conversely, 
if (Y*, V*) is an optimal solution to (P4), then the constraint 
(I24bb must indeed be active. Otherwise, Y* and v* could 
be simultaneously inflated with a factor p > 1 such that 
{pY*, pi/*) satisfies all constraints of (P4) with (I24bb being 
active and gives an strictly larger objective, which contradicts 
the optimality of (Y*, i/*). So {Y*,i/*) is feasible for (P3) and 
thus opt(P4) < opt(P3). Consequently (P3) and (P4) have 


equal optimal value. This means solution to either problem 
also solves the other one. Thus any solution (Y*, i/*) to (P4) 
is also a solution to (P3) and by lemma [T] i/* > 0. 

Assuming that we have obtained an optimal solution 
(Y*, i/*) to (P4), we now generate a sufficiently large number 
of independent complex random variables following the Gaus¬ 
sian distribution CY(0, Y*). The intuition behind randomiza¬ 
tion comes from the observation of the following stochastic 
optimization problem 

(P5) : max. Ef,...eN(o,Y){f^Af}, (25a) 

s.t. Y) {f^Bf} +0)1/ < 1, (25b) 

Efo..eJt(o,Y){f^Dif} i = 1, • • • ,L, (25c) 
i/>0. (25d) 


By utilizing the relation Ef.^e 3 .^(o^Y){ff^} = Y, the stochastic 
problem (P5) actually becomes the SDP problem (P4). The 
random variable f ~ CY(0, Y*) solves the problem (P4) in 
expectation. Thus if we have a sufficiently large number of 
samples, the “best” sample should solve the problem. 

The “best” sample can be found as follows. First, note that 
random samples are not always feasible for (P4). This issue 
can be addressed by the following rescaling procedure. For 
each sample f, we define the scaling factor /3(f) as 


/3(f) 


min. 

i—l,--- ,L 


1 - CqZ/* / Pjl/* \ 

f^Bf ’ y f^D,f J ’ 


(26) 


and rescale the sample f as 



(27) 


It is easy to check that the obtained f is guaranteed to be 
feasible for (P4). Thus by performing the above rescaling 
procedure we can obtain a large number of feasible samples 
to approximate the optimal solution Y*. Then we choose the 
one giving maximal objective value as solution to the problem 
(PI). When the number of samples is sufficiently large, the 
obtained best objective value of (P4) by rescaled random 
samples can be extremely close to true optimal value of (P4) 
so the randomization solution can be regarded as tight to the 
original problem (PI). 

In retrospect to the previous discussion, the motivation of 
transforming the problem (P3) into its equivalent (P4) now 
becomes clear. For implementation there is no chance that the 
randomly generated samples will satisfy the equality constraint 
(1^ . At the same time the positivity of i/* guarantees that 
the rescaling in dZTl l can be performed. 

Up to here, we have actually come out an alternative 
maximization method to solve the SNR optimization problem 
(PO) in 0. The algorithm starts from a random feasible point. 
In each iteration g is optimized in a closed form by theorem 
[T] with {Fi}f^^ being fixed and {Fi}f^^ are optimized by 
solving (P3) followed by randomization-rescaling or solving 
linear equations with g given. 

This algorithm is summarized in algorithm [T] as follows. 







Algorithm 1: 2-Block BCA to solve (PO) using SDR and 
randomization) 

1 Initialization; Randomly generate nonzero feasible 

such that obtained by theorem [ 1 ] is also 
nonzero; j = 0 ; 

2 repeat 

3 Solve (P3) and obtain (Y*, 

4 if L < 3 then 

5 Reduce rank of Y* as in Theorem |2j Obtain 
i-Ti ji=l> 

6 else 

7 Generate sufficiently large number of samples 
following C?vr(0, Y*); 

8 Rescale each sample by (l26l l and (l27l l: 

9 Select among all rescaled samples the one giving 

maximal SNR as 

10 end 

11 Update by theorem [T] j -f -f; 

12 until the increase of SNR becomes sufficiently small or a 
predefined number of iterations is reached'. 


B. Iteratively Solving (PI) 


only if the following optimization problem (Pb^) 

(P 67 ) : min . u (31a) 

f ,n >0 

s.t. f^Af > 7 f^Bf -P 7C0, (31b) 

f^D f 

— ,L}. (31c) 

has optimal value smaller than 1 , i.e. opt(P 6 -y) < 1. 

Next we show that all constraints of problem (P 6 -y) can be 
written in a second order cone form. The constraint (I31bl) . 
utilizing the result of lemma 0 can be written as 

7f^Bf+ 7C0 < |a"f|2 (32) 

Another key observation is that the optimal f* to (P 6 -y) is 
phase invariant—is optimal solution to (Pb..,,) if and 
only if (ej®f*,M*) is optimal for any real value 0. So without 
loss of optimality we assume that a^f = v with v being 
a nonnegative real value. Thus the constraint (l3^ readily 
becomes the second order cone ^/ 7 || [f^Bs, ,/co] < v. For 

the 7 -th power constraint in (I31cl) . it can also be written in a 

_1 j ^ 

second order cone form P- ' ||Dj?^ f ||2 < u. Thus the problem 
(Pb^) can be equivalently written in a standard second-order 
cone programming (SOCP) form: 


In the last subsection, we solve the problem (PI) with help 
of semidefinite relaxation by first solving its SDP relaxation 
and than construct the rank-one solution through solving linear 
equations or randomization method. In this subsection, we 
propose an alternative method which to solve (PI) in an 
iterative manner. First we have the following conclusion 

Lemma 3. Matrix A in (PI) is rank-one. Specifically A = 
aa^ with the vector a being given as 


min . u 

f , 22 , 2 ; 
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(33a) 

s.t. 

0 ^ V 

0 

PfcO _ 

■ f ■ 

1 

< 

2 

(33b) 

Re{a^f} = V, 




(33c) 

lm{a^f} = 0 , 




(33d) 


/^Dff 

< 77, 

= 1 ,- 

•• ,L. 

(33e) 


Iki ® H^g 
Ikl ® H^g 


(28) 


Proof: See appendix O ■ 

Now looking at the fractional SDP objective of (PI) we 
have the following observation. For any given nonnegative real 
value 7, the SNR is no smaller than 7 is equivalent to the fact 

f^Af > 7 f^Bf -P 7Co. (29) 

In other words, if opt(Pl) > 7, then there exits some f such 
that the inequality (1291 ) and all power constraints f^D^f < Pi 
for 7 = 1, • • • , L are simultaneously satisfied. If we define u 
as follows 


77 = max. 

2=1,••• ,L 


J’ 


(30) 


then the fact that all power constraints are satisfied is equiv¬ 
alent to 77 < 1. Thus the statement opt(Pl) > 7 holds if and 


Thus if we know that the opt(Pl) lives in some interval, then 
opt(Pl) can be determined by a bisection search—we set 7 as 
middle point of the current search interval, if (opt(P7.y) < 1), 
then opt(Pl) can achieve higher value and 7 is a lower bound 
of opt(Pl). Otherwise 7 upper-bounds opt(Pl). 

Now the remaining problem is to determine an interval 
containing opt(Pl), from which the bisection search can 
start with. Since (PI) is maximization problem, any feasible 
solution gives a lower bound of opt(Pl). The following lemma 
provides an upper bound of opt(Pl). 

Lemma 4. Optimal value of (PI) has an upper bound as 
follows 


Proof: See appendix iDl ■ 

Thus we have obtained an alternative method to solve the 
original problem (PO), which also falls in the 2-block BCA 
framework. The steps are summarized in Algorithmic 





























Algorithm 2: 2-Block BCA to solve (PO) based on SOCP 


1 Initialization: Randomly generate nonzero feasible 

such that obtained by Theorem [T] is also 


nonzero; j = 0; 

2 repeat 


3 

4 

5 

6 

7 

8 
9 

10 

11 

12 


Obtain bdi = SNR({fP}^^, and bdu by (O; 

repeat 

Set 7 = {bdu + bdi)/2; solve (PT^,); 
if opt(P77) < 1 then 
I bdi = 7 ; 

else 

I bdu = 7 ; 

end 

nntil {bdu — bdi) is small enough, 

7 = bdi; 


13 Solve (P7^) to update {Fp'''^^}^^; 

14 Update g(2+i) by theorem [T] j + -P; 

15 nntil the increase of SNR is sufficiently small or a 
predefined number of iterations is reached; 


C. Convergence and Complexity Analysis 

The two 2-block BCA algorithms developed in the previous 
subsections have the following convergence property: 

Theorem 3. The sequence of SNR obtained by algorithm Q] 
or^converges. Moreover the solution sequence generated by 
algorithm Q] or |2] has limit points and each limit point is a 
stationary point of problem (PO). 

Proof: See appendix |E] ■ 

The complexity of the proposed algorithms is complicated 
since the whole network has too many factors (ATi’s and Nfs) 
that impact the problem size. To simplify the analysis, we 
consider homogeneous sensor networks, where each cluster 
has the same number of sensors and each cluster-head has the 
same the number of antennas, i.e. Ki = K and Ni = N for 
all i = 1, • • • ,L. 

Using the primal-dual interior point method ll24l . we can 
show that the complexity to solve (P4) is 
The complexity to update g by ([8]l comes primarily from the 
matrix inversion operation, which has a complexity O(M^). 
In general, a few thousand of random samples are sufficient 
to guarantee a satisfying tightness of the obtained rank- 
reduced solutions (usually within 10“^ from the true opti¬ 
mal value), and the required number of samples does not 
increase with the network size. Thus, the complexity for each 
outer-layer iteration of the SDP-based 2BCA algorithm is 

From II 23 I . the complexity for solving the SOCP problem 
(P7.y) is Recall that each round of bisec¬ 

tion search solves (P7.y) once, so (P7.y) is solved multiple 
times within one outer-layer iteration. Taking different channel 
conditions and levels of predefined precision into account, 
numerical results show that the number of times solving (P7.y) 


varies between the narrow range [25,35] and thus can be 
considered as a constant. Thus the complexity of outer-layer 
SOCP-based 2BCA algorithm is 0{L^-^K^N^ + M^). 


V. Multiple Block Framework to Maximize SNR 


In the previous sections, the proposed algorithms are both 
2-block coordinate ascent methods where all the beamformers’ 
sensors are jointly updated. One problem for these algorithms 
is that the complexity of solving the associated SDP or SOCP 
problem grows intensively with the increase of the size of 
the wireless sensor network. Instead of jointly optimizing all 
beamformers, we can alternatively focus on just one sensor’s 
beamformer each time. This actually results in a multiple- 
block BCA approach, which, despite the many subproblems 
involves, often involves a lower complexity (see complexity 
analysis and numerical results). Specifically, for the case of 
Ki = 1 {\ sensor in the i-th cluster), the solution to the 
ith block is quickly obtained in a closed form, requiring no 
numerical solvers. 

Now we consider the problem of optimizing the i-th beam- 
former Fi with g and {Fj}j^i being fixed. By introducing 
the following notations 


Qi — j Ci— ^ fj- Ajkfk; 
di = 


this problem is formulated as follows 

f/^Aiifi-|-2Re{qf fj -f a 


(PIU : max. 

fi 




s.t. f/^Cifi < Pi- 


(35a) 

(35b) 


(36a) 

(36b) 


A. One-Shot SDR-Rank-Reduction Method 

First we introduce a one-shot method to solve (PI*), which 
performs semidefinite programming and rank-one matrix de¬ 
composition in tandem. This method is discussed in recent 
work ca and ini. 

By use of Charnes-Cooper’s transformation and rank-one 
relaxation we turn (PI*) into the following relaxed version 


max. TrjQiZ}, 

Z,r; * ■’ 

(37a) 

s.t. Tr{Q 2 Z} = 1, 

(37b) 

TrjQaZ} < P.p, 

(37c) 

Tr{Q4Z} = p, 

(37d) 

Z 0,77 > 0. 

(37e) 


with parameter matrices being defined as 


Qi = 


Q3 = 




qr Ci 

c, 0 

0"^ 0 


.Q2 = 


,Q4 = 


B, 0 

_ 0^ d^ 

O 0 

0^ 1 


(38a) 

(38b) 


Solving the SDP (P7*) we obtain an solution (Z*,? 7 *). If 
the Z* is rank one, i.e. ^ = z*z*^ with z* = [z^, 2 ; 2 ]^j 















then z|/z 2 is an solution to (PI*) and the relaxation (P7*) 
is actually tight with respect to (PI*). Actually the rank-one 
solution Z* always exits due to the recent matrix decompo¬ 
sition result in na. In fact if Z* has rank larger than one, 
by help of theorem 2.2 in m, we can obtain a vector z 
such that the equations Tr{(Qi — opt(P7*)Q2)zz^} = 0, 
TrjQjZz^} = Tr{QjZ*} for j = 3,4. This means {zz^,r]*) 
is rank-one optimal solution to (P7*) and thus (PI*) can be 
solved. 


B. Iterative Method 

Besides the above one-shot method, here we propose an 
alternative iterative method to solve (PI*). As we will shortly 
see this iterative method can give birth to extremely efficient 
solution to (PI*) in specific circumstance. 

For any given positive real value a, the fact that the SNR 
objective in (PI*) is no smaller than a equivalently reads 

f/^[aBi-Aji]fi-2Re{qf fi}-f (adi-Ci) < 0. (39) 

This immediately implies that if the following problem with 

(P8^) : min. f/^[aB—A^i]fi-2Re{qffi}-|-(adi-Ci), (40a) 

s.t. ff C,f, < P,. (40b) 

with a given has a nonnegative optimal value then opt (PI*) > 
a. Otherwise a can serve as an upper bound of opt(Pl*). Thus 
we can perform a bisection search to solve (PI*). Now the 
problem reduces to how to solve the problem (P8(j)? Note that 
the quadratic matrix [aB—A^^] can be negative semidefinite 
or indefinite and thus (PS),,) is possibly nonconvex. The 
following theorem convinces us that (P8),) can always be 
solved regardless of the convexity of its objective. 


Theorem 4. If the i-th sensor cluster has more than one sensor 
or the head is equipped with multiple antenna, i.e. Ki > 2 or 
Ni > 2, the problem (P8^) can be solved. 

Proof: See appendix |F] ■ 

Although theorem |4] shows that the problem (P8),) can 
be solved by SDR and thus the iterative method to solve 
(PI*) works, it is generally less efficient than the one-shot 
method discussed above. Since the former performs semidef¬ 
inite programming and rank-one reductions multiple times 
while the latter for just once. However in the circumstance 
where Ki = 1, the following theorem indicates that (P8),) 
has fully closed form solution and consequently the iterative 
method can become extremely efficient. 


Theorem 5. When Ki = 1, denote Sg and as scalars 

and {crfJfLi respectively. The solution f* to (P8^) is 


/*= 


/3*Hf g .r 2^1 \Pt\ 

l|Hf glll(Qcrj'-l) ’ ’ ||Hf g||2(a(T2-l) 

l/3i|||Hfg||2 




Otherwise, 


with Pi and Pi being defined as follows 




K, 


'Hfg 




P — 


(41) 


Proof: See appendix iGl ■ 

Note that the closed-form solution in Theorem |5] neither re¬ 
quires matrix decomposition or solving linear equations(matrix 
inversion) nor depends on numerical solver. Thus iteratively 
solving (PI*) is easy for implementation and has very low 
computation cost. Comparatively, the one-shot method for 
solving (PI*) depends on numerical solvers (like CVX) which 
are iterative-based (interior point method) solvers with each 
iteration performing matrix decomposition and solving linear 
equations. 

To start the bisection search, the latest SNR can serve as a 
lower bound for opt(Pl*). From (fSTl l. we can derive an upper 
bound for opt(Pl*): 

d-i(||Hfg||2p,A-)„(Q)+2||q.||2P/A-i(Q)+c.). (42) 

Note that the above upper bound can be much tighter than the 
one given in ( |34] | since it utilizes the knowledge of {Fj}j^i. 

From the above discussion, we can utilize multi-block 
BCA method to solve the original problem (PO). In each 
update, we tackle either one individual precoder (associated 
with one cluster) or the poster (associated with the FC). 
If the *-th cluster has Ki > 1 sensors, then its precoder 
can be updated by the one-shot SDR-rank-reduction method; 
Otherwise, Theorem |5] provides a clean closed-form solution. 
The entire approach is summarized in Algorithmic 


Algorithm 3: Multi-Block RCA to solve (PO) 

1 Initialization; Randomly generate nonzero feasible 

such that g(°) obtained by ® is nonzero; 

2 repeat 

3 

4 

5 


for 2 = 1,••• ,L; do 

if ATi > 1 then 

Solve (P7*); Then perform rank-reduction 
using Theorem 2.2 in llT4l : Then update Fi ; 

6 else 

7 Set bdi as current SNR; obtain bdu by (l42T i; 

8 repeat 

9 Set a = {bdu + bdi)/ 2', 

10 Solve (P8)j) by Theorem (|5]i; 

11 if opt(P8)j) < 0 then 

12 I bdi = a; 

13 else 

14 I bdu = ct', 

15 end 

16 until {bdu — bdi) is small enough; 

17 a = bdi; 

18 Solve (P8),) by Theorem |5j update F^; 

19 end 

20 Update g by theorem [T] ; 

21 end 

22 until the increase of SNR becomes sufficiently small or a 
predefined number of iterations is reached; 


Although the multiple BCA method generates monotoni- 
cally increasing SNR sequence, it is hard to prove that the 














limit points of its solution sequence guarantee to converge to 
stationary points of (PO). Numerical results in section IVll show 
that multiple BCA algorithm usually has a very satisfying 
convergence behaviors. 

By primal-dual inter point method 1241, the complexity of 
each outer-layer iteration of multiple BCA for homogeneous 
wireless sensor network is 0(^LK^-^N^-^+LM^). Particularly 
for homogeneous network with K = \, the complexity 
becomes O(LM^) with the help of theorem |5] 

VI. Numerical Results 

In this section, numerical results are presented to testify 
the proposed algorithms’ performance. In our experiments, 
the observation noise at each sensor is colored, which has 
a covariance 

S, = a2So, (43) 

where the Ki x Ki matrix Sq has the Toeplitz structure 


So = 


The parameter p is set to 0.5 for all sensors in the following 
experiments. Here we dehne the observation signal to noise 
ratio at the i-th sensor as SNR^ = a~^ and the channel signal 
to noise ratio as SNR = 

In hgure |2] and |3] the average SNR obtained at the FC are 
plotted. It is assumed that the sensor network has 5 sensors and 
FC has 4 antennas. We set 7Vi = 3, N 2 = 4, N 3 = 5, N 4 = 
A,N5 = 5, iTi = 3,K2 = 4,iT3 = 5,iT4 = 6,1^5 = 6 
and Pi = 0.2, P 2 = 0.2, P 3 = 0.3, P 4 = 0.2, Pg = 0.3. For 
each fixed channel SNR level, 50 random channel realizations 
are generated with each element of channel matrix follows 
circularly symmetric complex Gaussian distribution with zero 
mean and covariance 2. With channel SNR and channel 
matrices given, the proposed algorithms are performed starting 
from one common random initial. The obtained average SNR 
is plotted in figure |2] and [3 The obtained average SNR of 
SDR based and SOCP based 2BCA algorithms are plotted in 
figure in with respect to different outer iterations. The curve 
associated with random initials actually represents the perfor¬ 
mance of random feasible linear transmitters. From hgure |2] 
optimized SNR converges in 10 outer-iterations on average. 
These two algorithms have identical average convergence 
performance, this will also be verihed by hgure |4] The average 
SNR performance obtained by multiple BCA algorithm is 
presented in hgure [3l where SDR based 2BCA algorithm 
serves as a benchmark. Multiple BCA algorithm presents 
identical average SNR performance with the other 2 block 
algorithms. 

In hgure |4] and |5] the impact of different initial points to 
the algorithms are examined. The system setup is identical 
with the experiment in hgure |2] and |3] We set the channel 


1 

P 


P 

1 


„K-1 


(44) 


Average SNR Obtained by SDR Based 2BCA Algorithm and SOCP Based 2BCA Algorithm 



Fig. 2; Average SNR Obtained by SDR Based 2BCA Algo¬ 
rithm and SOCP Based 2BCA Algorithm 


Average SNR Obtained by SDR Based 2BCA Algorithm and Multiple BCA Algorithm 



Fig. 3; Average SNR Obtained by SDP Based 2BCA Algo¬ 
rithm and Multiple BCA Algorithm 


SNR as 2dB and hx the channel matrices with one specihc 
random realization. The three proposed algorithms are started 
from 10 different random initials and each SNR itinerary 
with respect to number of outer-layer iterations is plotted in 
hgure m and where the itineraries of SDR based 2BCA 
algorithm serve as benchmarks. From hgure 0] it can be 
seen that the two 2BCA algorithms have almost identical 
SNR itineraries. Comparatively, multiple BCA algorithm’s 
itineraries are usually very different but hnally it will converge 
to identical value. Figures 0] and |5] rehect the fact that; the 
proposed three algorithms are initial-insensitive; they hnally 
converge to identical SNR value; and usually 30 iterations are 


























Convergence Behaviors with Different Initial Points: SDR-Based 2BCA v.s. SOCP-Based 2BCA 



Fig. 4: Convergence with Different Initials; SDR Based 2BCA 
Algorithm v.s. SOCP Based 2BCA Algorithm 


Convergence Behaviors with Different Initial Points: SDR-Based 2BCA v.s. Multiple BCA 



Fig. 5: Convergence with Different Initials; SDR Based 2BCA 
Algorithm v.s. Multiple BCA Algorithm 


sufficient for these proposed algorithms to converge. 

Next we present numerical results for complexity. Still we 
take homogeneous wireless sensor network as example. N and 
M denote the number of antennas for each sensor or sensor 
cluster and FC respectively and take modest values within 
several tens. Comparatively the number of sensors or sensor 
clusters can be large, and one cluster can have numerous 
sensors. So we focus on the impact of L and K on the 
complexity. Figure |6] and |7] represent the complexity for each 
outer-layer iteration for proposed algorithms with respect to 
K and L respectively. Generally SDR based 2BCA algorithm 
has higher complexity than the two others. The SOCP based 



Fig. 6; Complexity of Algorithms with Respect to K 



Fig. 7; Complexity of Algorithms with Respect to L 


algorithm has lowest complexity for large K with small L and 
multiple BCA algorithm has the lowest complexity for large 
L. 

In the following the average execution time of proposed 
algorithms using MATLAB with the standard toolbox CVX 
v2.1 on the same computer are presented in Table HI and HIl 
The multiple BCA algorithm requires much lower time for 
networks with large L and SOCP based 2BCA algorithm 
is more efficient for large K and small L. Although the 
complexity of SDR-based 2BCA algorithm increases drasti¬ 
cally with the increase of K, N and L in general, it can 
still be useful in specific scenarios. Note that when the size 
of wireless sensor network is small, the execution time of 
SDR based 2BCA algorithm mainly comes from random 
































































TABLE I: MATLAB Running Time Per (Outer) Iteration(in sec.) 


Dim. 

Alg. 

L = 5 

L = 10 

II 

to 

0 

L = 30 

II 

0 

K= 1 

Alg 

1 


1.814 

3.561 

8.462 

18.58 

34.76 

Af = 4 

Alg 

2 


5.677 

9.163 

15.84 

23.14 

38.82 


Alg 

3 


0.067 

0.380 

2.603 

8.344 

19.38 

K = l 

Alg 

I 


2.175 

5.413 

21.97 

59.23 

148.5 

N = 8 

Alg 

2 


7.488 

12.21 

25.58 

51.19 

84.17 


Alg 

3 


0.073 

0.406 

2.741 

9.387 

19.27 

K = 3 

Alg 

I 


2.650 

9.002 

43.07 

158.5 

689.5 

Af = 4 

Alg 

2 


10.462 

21.40 

54.79 

111.9 

45.60 


Alg 

3 


1.106 

2.423 

6.549 

14.29 

26.95 

K = 3 

Alg 

I 


7.222 

32.32 

536.9 

— 

— 

N = 8 

Alg 

2 


19.765 

50.99 

173.6 

59.32 

85.81 


Alg 

3 


1.650 

3.519 

9.286 

18.19 

31.74 

K = 5 

Alg 

I 


4.468 

19.65 

160.3 

— 

— 

K = 4: 

Alg 

2 


14.944 

32.85 

125.8 

50.04 

69.08 


Alg 

3 


1.455 

2.989 

7.749 

16.63 

30.41 

K = 5 

Alg 

T 


16.442 

115.8 

— 

— 

— 

N = 8 

Alg 

2 


32.273 

121.0 

80.21 

134.2 

201.3 


Alg 

3 


2.662 

5.617 

13.51 

24.99 

42.33 


Note: "—" means the problem is too lai'ge to be solved. 

Alg.l: SDR-2BCA alg.; Alg.2: S0CP-2BCA alg.; Alg.3: multiple BCA alg. 


TABLE II: MATLAB Running Time Per (Outer) Iteration(in sec.) 


Dim. 

Alg. 

a:=2o 

0 

CO 

II 

0 

II 

L= 2 

N = W 
M = 3 

AlgljJ 

AlgE 

Alg[3] 

625.8 

89.47 

1.903 X 10® 
2.171x103 

5.378x103 

Note; ■ 

—" means the prob 

em IS too large 

to be solved. 


Alg.l: SDR-2BCA alg.; Alg.2: S0CP-2BCA alg.; Alg.3; multiple BCA alg. 


samples generation and rescaling. In the case where parallel 
computation is available, this procedure can requires very little 
time and thus competitive to the other two algorithms. 


If solves (P3), since v* > 0, it is easy to 

check Y*/i/* is feasible for (P2) and gives an objec¬ 
tive value of Tr{B^v/o+M " opt(P3)- So opt(P3) < 
opt(P2). On the other hand, if X* solves (P2), then 

( Tr|BX*+co} : Tr{BX^+co} ) Solution to (P3) and 

gives objective value of Tr{A y^|g^,^^ | } = opt(P2). So 
opt(P2) < opt(P3). The proof is complete. ■ 


B. Proof of Lemma \2\ 

Proof: Eirst we prove that (P3) is solvable. By (I21bl) we 
have 0 < 1 / < so is bounded. Combining (I21cl l we 
readily obtain Tr{DiY} < PiV < Pi/co, i € 
which implies Tr{(X;f^i Di)Y} < Ti)/co. Since 

= DiagjCi,--- ,Cl} >- 0, this means Y is 
bounded. So the feasible region of (P3) is bounded. Obviously 
the feasible region of (Y, i/) is also closed. So (P3) has 
compact feasible region. Since the objective Tr{AY} always 
takes finite values on the whole feasible region, by Weierstrass’ 
theorem(proposition 3.2.1-(1) in ifTSl ). (P3) is solvable. 

The Lagrangian function of problem (P3) is given as 


L(Y,i/,A,{/r,}ti) (45) 

L 

= Tr{ YA} + A (l-Tr{ YB }-coiy) - ^ (Tr{YD, 

i=l 

L L 

= Tr|^ I^A—AB — ^ HiDi Yj--I- ^ — cqA-I-^^ Piii^vpX. 


VIE Conclusion 

This paper considers the joint transceiver design problem 
in cluster based wireless sensor network. To maximize the 
output SNR at the fusion center, the difficult original problem 
is decomposed into two or more subproblems and solution to 
each subproblem is obtained. Convergence and complexity are 
carefully examined. Extensive numerical results show that the 
proposed algorithms provide equivalently good SNR values 
while have different efficiency characteristics and suitable for 
various system settings. As an extension of current problem, 
robust design and decentralized algorithms are desirable and 
meaningful for future study. 

Appendix 

A. Proof of Lemma Q] 

Proof: Assume that X* and {Y*,v*) are optimal solu¬ 
tions to (P2) and (P3) respectively, and opt(P2) and opt(P3) 
are optimal values of the two problems. 

Eirst we claim that v* > 0. This can be proved by contra¬ 
diction. If V* = 0, then we readily obtain Tr-fD^Y*} = 0, 
for i = I,-- - ,L. This leads to Tr{ D^jY*} = 0. 

Since it is assumed that 0, for i S it 

holds that 0, for i G i^}- Thus = 

DiagjCi,--- ,Cl} >- 0 and we obtain Y* = O. However 
this violates the constraint (I21bl i. since its left hand side equals 
zero. Thus ly* > 0. 


By taking the supremum of Lagrangian function with respect 
to Y 0 and z/ > 0, the dual function is obtained as 

= sup. C(Y,i/,A,{y 2 i}f^i) = A (46) 
y:^o.i/>o 

with the conditions [A —AB — Pi^i\ =4- 0 and ( —cqA-I- 
Si=l PiPi) ^ 0 satisfied. So the dual problem of (P3) can 
be given as 


(D3) : min. A 

(47a) 

L 


s.t. AB + ^ piDj A, 

(47b) 

2 —i 

L 


2—1 

(47c) 

> 0 , z = 1 , • • • , L. 

(47d) 


Next we prove that (D3) is solvable. To do this it is suf¬ 
ficient to show that there exists a real value 7 such that 
the level set {(A, {Ai*}fLi)|A < G dom(D3)} 

is nonempty and bounded, where dom(D3) means feasible 
region of (D3). Here we choose fli = /^miniCf) 

for i = I,- - • ,L, where \max{') and \maxf) represent the 
maximal and minimal eigenvalue of a matrix respectively. Set 
A = By definition the constraints (I47cl i and 

































( 147dl ) are satisfied by (A, Since B :>= 0 and A > 0, 

L L 

AB ^ ^ 'rp ^ ^ — Diagj^^iCi, * * * 5 

i=l i=l 

^ '^rnax(A)lj2i_^ A. (48) 

Thus constraint (I47bl l is also satisfied by {\,{pi}f^i). Set 
7 = A. Combination of A < 7 and the constraint (l47a 
guarantees that A and all ^^’s are bounded. So we con¬ 
clude that the level set {(A, {/ii}fLi)|A < 7, (A, £ 

dom(D3)} is nonempty and bounded. Invoking Weierstrass’ 
theorem(proposition 3.2.1-(2) in lIT^ '). (D3) is solvable. ■ 

C. Proof of Lemma |5] 


By Cauchy-Schwarz inequality the numerator f^Af of SNR 
is bounded as 


a^ff 

< 

^|ff(lK.®Hfg)| ' 

(59) 

L 



2 


El|f 

* 2 

\1 k, OHfg 2 


(60) 

(Sv 

'A™„(C,) *11 ‘ 

sIL 1 ’ 

(61) 


where the above first inequality uses Lemma [3 Combining 
the fact that f^Bf + Co > Co, the upper bound in the lemma 
is proved. ■ 


Proof: Recalling the definition of Ay in dlSbl i and 
utilizing the identity (AB) 0 (CD) = (A 0 C) (B 0 D) 
HD, we have 


Ay = (lic.l^J ® (Hfgg^H,) 

= (1k, O Hf g) (1^. O g^Hj). 

Then the j-th block column of A is given as 


Ay = 




Al 


'3 _ 


( 1^1 ® Hf g) (if^ O g^Hj) 


'-Kl 


’Hfg)(lf. Og^Hj) 


{Ik, ® Hf g) 


= a 1 


(lK.®Hfg) 
Jg^H,). 


{ll ® g^H,) 


(49) 


(50) 


(51) 


(52) 


The last equality utilizes the definition of a in (I28l l. Then the 
matrix A can be represented by packing all its column blocks 
as follows 


A=[A:1,.-- ,A.i] 

= a(l]^i Og^Hi),- 


= a 




T 
K 

g^Hi),... ,(lf^®g"Hi) 




g^Hi) 


= aa 


H 


(53) 

(54) 

(55) 

(56) 


The proof is complete. 

D. Proof of Lemma 0 

Proof: By the z-th power constraint ( I19bl ) we have 
A^„(C,)||f,||2<ffC,£, <Py 

which implies 


< 


P, 


^min (Q)' 


z = 1 , • • 


,T. 


(57) 


(58) 


E. Proof of Theorem 0 

Proof: Since each update of or g is obtained 

by solving a maximization problem, SNR monotonically in¬ 
creases. At the time we note that SNR is bounded. In fact 
since the SNR is invariant to scaling of g we can assume that 
||g ||2 = 1. According to dSST l in the proof of lemma |4l 
is bounded for all z = 1, • • • , L. Thus the numerator of SNR 
is bounded above and the denominator of SNR is bounded 
away from zero, so SNR should be bounded. Consequently 
the objective value sequence by algorithms [T] or |2] converges 
since it is monotonically increasing and bounded. 

Since are bounded, by Bolzano-Weierstrass the¬ 

orem II 20 I there exists a sequence {jk}^i such that 
converges. Since is updated by ((3) 

which is a continuous function of thus the se¬ 

quence {({Fp'“^}|7j^,g*^'^'“))also converges. The exis¬ 
tence of limit points of the solution sequence is proved. 

The feasible region of (PO) is a Cartesian product Xi x X 2 
with Xi = | (l7bl i is satisfied for z = 1, • • • ,L} and 

X 2 = C^^^\{0}. Corollary 2 in lfT9l states that any limit 
point of solution sequence generated by 2-block coordinate 
ascent method is stationary. It should be noted that this 
conclusion is obtained under the assumption that the objective 
function is continuously differentialbe on feasible region and 
each block feasible region(each term in the Cartesian product) 
is nonempty, closed and convex set. Unfortunately the problem 
(PO) does not satisfy this assumption since X 2 is nonconvex 
and not closed. In the following we will show that conclusion 
in m still applies to our problem after appropriately adjusting 
its argument. 

First we assert that the solution sequence always has 
nonzero g, i.e. 0 for all fc = 0,1, • • •. Since algorithms 

[T] or |2] starts from ({f|°^}( 7^,g(°)) with g(°) f 0, the asser- 
tion holds for k = 0 and SNR({f(°^}( 7]^,g'-^^) > 0. Assume 
that m > 1 is the smallest integer such that g^™) = 0, then 
according to ([S]) (X^fci HiF-"*^lifJ = 0. Notice ^ 

0. By (|6]l this readily implies SNR({F(™^}f7]^,g(™“^)) = 
0 < SNR({f)°^}(7j, g^°^), which contradicts the increasing 
monotonicity of SNR. 

























Next we assert that any limit point ({Fi}|li,g) of so¬ 
lution sequence has nonzero g. By contradiction we as¬ 
sume that the subsequence { con¬ 
verges to ({Fijl^j^jg) with g = 0. Then by ® 

{ Sf — 7 ^ 0. By rescaling each to g('^'=) 

such that ||g ^^'‘^||2 = 1 for all k — 1 , 2 ,-we actually 
construct another solution sequence which is also generated 
by 2 -block coordinate ascent method, since scaling of g does 
not change the SNR value. Now for this new solution sequence 


while (consequently the denominator of SNR) is 

bounded away from zero, we have SNR({Fp''^}^^, —>• 

0, which again contradicts the increasing monotonicity of SNR 
sequence. 

In 1191 the closedness assumption of X 2 is implicitly 
invoked in its proposition 2 to ensure that any limit point of 
solution sequence is feasible. Through the above proof we can 
see that this result holds true thus proposition 2 in lfT9l applies 
to our problem. 

The convexity assumption of X 2 is explicitly utilized in lfT9l 
in its proof of proposition 3. Here we identify the notations 
i, Xi+i, Xi+i and w{k,i) used in the original proof of 
proposition 3 in ifT^ as 1, g, X 2 and g(*^)) 

respectively in our case. 

According to the proof in ED, we can find a descent 
direction = g — g*^^^ with g G X 2 and 

Vg(.,SNR({Ff+'’}ii,g«)^(g-g«) < 0 . (62) 

( (l62l l corresponds to the inequality Wi+lf{w{k,i))'^df_^_l < 
0 in the original proof of proposition 3 in CD, which lies 
under equation (11) and is not labeled with number). Then 
by Armijo-type line search we can update with g^^^ + 
Q! 2 *^d 2 ^\ where d^*^ G X 2 and G (0,1]. The convexity 
of X 2 in IIT 9 I guarantees that g^^^ + G X 2 . 

Now we show that the fact g^^^ + G X 2 still 

holds for our problem although our X 2 is nonconvex. By 
contradiction assume that g^^^ + ^ X 2 , i.e. 


g 


(k) 


(k)i(k) ^ 

a^2 = 0- 


(63) 


This is actually impossible. By substituting d^^^ = g — g^^^ 
into (l63t we have 


(af ^ = 


„(fc) 


g- 


(64) 


As a result of Armijo-type line search algorithm(refer to 
(3) and proposition 1 in llT9l ). — 1) G [0,1). If 

072*^ = 1, then g = 0, which contradicts the fact g G X 2 . 

If (a 2 ^^ ~ 1) < I 7 then g = This is also 

0^2 

impossible since SNR is invariant to scaling of g and thus 
Vg(fc)SNR({Fj^^^^^}|G^, g^^^)^(g —g^^^) = 0 , which contra¬ 
dicts the fact Thus the proposition 3 in m also stands 
for our problem. 

As a direct implication of proposition 2 and 3, the corollary 
2 in m holds true and thus any limit point provided by 
algorithm [T] or 121 is stationary point of (PO). ■ 


E Proof of Theorem |4| 

Proof: Since the problem (P 8 ^) is a quadratic problem 
with one quadratic constraint and is obviously strictly feasible, 
the result of Appendix B.l in ll22l is valid to invoke, which 
states that (P 8 ^) has the following relaxation 

(P9(,)min. Tr{[Q!Bi-Aii]X}-2Re{qf x}-|-(ad—c*), (65a) 

s.t. Tr{QX} - P, < 0, (65b) 

^ ^ PO. (65c) 

with opt(P9(j) =opt(P 8 ^). We replace the variables (X, x) 
in (P9^) by one matrix variable X and rewrite it into a SDP 
form 


(PlOj,) min. TrjPiX}, 

s.t. TrjPaX} < P„ 

TrjPaX} = 1, 

with the parameter matrices being defined as 


Pi= 


-qf 


-q^ 

Otdi Ci 


,P2= 


c, 

0^ 


,p.3= 


o 

0 ^ 


( 66 a) 

(66b) 

( 66 c) 

o' 

1 ■ 


Since Ci 0, the feasible set of (P10(,) is bounded. 
Obviously the objective of (PIOJ,) takes finite value over 
the feasible set, so (PIOJ,) is solvable by Weierstrasss the- 
orem(proposition 3.2.1-(1) in ifTSl l. 

Assume that X* is one optimal solution. Obviously X* is 
non-zero(otherwise constraint P 3 X* = 1 would fail). Since 
X* has dimension KiNi + 1 > 3, evoking theorem 2.2 of lfT4l . 
we can obtain a vector x such that Tr{PjX*} = Tr{PjXX^} 
for j — 1,2,3. Denote x = [xf’,52]^- Notice that X 2 is 
nonzero(otherwise the constraint TrjPaXX^} = |i 2 p = 1 
would fail). Define x = [x^/i 2 ,1]^, it is easy to check that 


/o(?i)=Tr{Pixx"}=Tr{PiXX^}=Tr{PiX*}=opt(P10(,); 

X 2 

fi (?i)=Tr{P 2 Xx"}=Tr{P 2 S£^}=Tr{P 2 X*} < P,. ( 68 ) 

X2 

where /o(-) denotes the objective function of (P 8 ^) and 
/i(x) = x^CiX. The above two equations imply that Sci/x 2 
is an optimal solution to (P 8 ^) since which gives optimal value 
opt(PlOQ) and is feasible. ■ 


G. Proof of Theorem |5| 

Proof: When Ki =■ 1, the covariance matrices Sg and 
become scalars and {cr^jEi respectively and we 

have 


A,, = Hf gg^H„ B, = gg^H„ 
A„- = 1^,. ® (Hf gg^HO = Hf g(ir - 


'-Ki 


q* = XI 


[EO 

3^1 


Ki 


'g"H,)f, 


(69a) 
(69b) 
g^H,), (69c) 

(69d) 












To simplify the following discussion, we introduce the nota¬ 
tions 

P,. 




Ki 




Hfg),P. = _2, 2- 


O'? + (Jt 


(70) 


Hfgg"H, = U 


Ui = 


IIHfgIh 


CASE (II): a > cr~^. To minimize g{Ti), ti should be along 
the direction of /3*. Depending on whether the zero point of 
the absolute term in (TTST i satisfies the power constraint, two 
subcases are examined: 


Then the problem (P8^) in (l40l i is expressed as 
(Pllj,) :mm. (aa?-l)ffHfgg^H,f,-2Re{/3,g"H,f,} 

+ {adi-Ci), (71a) 

s.t. ||f,f < A. (71b) 

The key observation is that the quadratic matrix Hf^gg^Hi 
in the objective function has rank one and thus an eigenvalue 
decomposition as follows 

g-H,Hfg 1 

0(Ari-i)x(tVi-l)J 

with U = [ui,U 2 ,--- ,U 7 Vi] being eigenvectors of 

Hfgg^H,. The first eigenvector Ui corresponds to the 
unique nonzero eigenvalue and the other eigenvectors span 
the null space of H^gg^Hi. In other words, we have 

Hfg 


i) If 


1/3*1 


|Hfg||2(c<af-l) 


rpr < the optimum is given 


/3* 


U^gUaaf-iy 




(79) 


1/3*1 


W'af shiacr^-l) 


> \/l\, the optimum is given 


^/P^|3l 

m ' 


(80) 


p(rn=(aa/-l)||Hfg||^ 


V^- 


1/3. 


l|Hfg||2(Q!Crf-l) 


+Ci 


CASE (III): a < (J~ . Still ti should be along the direction of 
/?*, and takes full power. At this time the optimum is literally 
identical with dSOl l in the above. 

Remember that optimal f* to (P8^) is obtained by f* = 
T* , the proof is complete. ■ 


1 HhF 


g||2 


ufHfg = 0, j = (73) 
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Since is an orthonormal basis, f^ can be represented 

as ii = Ur = ^2^=1 with vector r being the coordinates 
in terms of basis 

By (l73t the objective of (Pll^) is independent of {rj}^ 2 - 
To save power, we should set all {rj}:J22 zero, which means 
ii = Ti ■ Thus, the problem (Pll^) boils down to the 

following problem with respect to one complex scalar ri 

(P12^) : min. 5 (ri) = (acr2-l)||Hfg||^|rip (74a) 

Tl 

-2||Hfg||2Re{Ari} + (ad,-Q), 
s.t. Inp <P,. (74b) 

Based on the sign of (cr| — a), the problem (P12^) can be 
tackled in the following three cases: 

CASE (I): a = cr~'^- In this case, the objective function in 
(P12^) degenerates to an affine function 

gin) = -2cr,^|lHf g||2Re{/3iri} + {ad^ - a). (75) 


By the Cauchy-Schwarz inequality, the optimal and mini¬ 
mum objective is obtained as 




; p(ri*) = -2a2 ||H,g||2|/3i I ^/P^+{ad, - q) ; (76) 


1/3. 

When a y a~^ by denoting 

Q ={adi-Ci) -- 


1/3. P 


aaf - 1 ’ 

the objective function is equivalently written as 


p(n) = (aaM)||Hfg||^ 


n- 


l|Hfg||2(Q:cr2_l) 


(77) 
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